Fast Sorting on the Sequent

نویسندگان

  • Simon Kahan
  • Walter L. Ruzzo
چکیده

We develop a series of quicksort algorithms for the Sequent Symmetry shared memory parallel computer. By employing a novel yet simple parallel splitting algorithm and dynamic scheduling we are able to achieve a speedup of 13 with 16 processors over the performance of sequential quicksort on one processor. This work provides experimental evidence that asynchronous algorithms’ more uniform usage of shared resources may make them less susceptible to bottlenecks on real machines. Introduction: On sequential processors, Quicksort is one of the most e cient algorithms for sorting large arrays of data [Hoa62]. Three factors are eminently responsible for its speed: locality of memory references, a tight inner loop, and, on average, a small number of comparisons. Locality ensures both that cache prefetching can be exploited and that paging is minimal. Because a high proportion of the inner loop operations are comparisons, the computational overhead per comparison is small. Even though some other sequential sorting algorithms perform hardly any more, and possibly fewer, comparisons in the worst case than Quicksort performs on the average [Weg90], Quicksort remains the algorithm of choice over a wide range of input sizes due to its e ciency on real machines. Attempts to parallelize Quicksort are evident in the literature of both theory and practice. Some of the relevant theoretical work on parallel Quicksort is based on PRAM [FW78] models of computation in which the number of processors available is unlimited, and access to memory locations are of constant cost independent of address and of the number of processors active. Martel and Gusfield [MG89] and Chlebus and Vrťo [CV91], for example, describe formulations of Quicksort on the PRAM model sorting N distinct keys in O(logN) expected time with N processors. Other PRAM variants of Quicksort are described in Akl [Akl85] and JáJá [JáJ92]. There is nothing intrinsically wrong with such PRAM algorithms, but certain ⇤Max-Planck Institute for Computer Science, Saarbruecken, Germany. [email protected] †Department of Computer Science & Engineering, University of Washington, Seattle, WA. [email protected]

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A sorting network in bounded arithmetic

We formalize the construction of Paterson’s variant of the Ajtai–Komlós–Szemerédi sorting network of logarithmic depth in the bounded arithmetical theoryVNC ∗ (an extension of VNC ), under the assumption of existence of suitable expander graphs. We derive a conditional p-simulation of the propositional sequent calculus in the monotone sequent calculus MLK .

متن کامل

A Static Analysis Method for a Classical Linear Logic Programming Language

In this paper, we propose a new static analysis method which is applicable for a classical linear logic programming language. Andreoli et al. proposed a static analysis method for the classical linear logic programming language LO, but their method did not cover multiplicative connectives which are important for a resource-sensitive feature of linear logic. Our method, in contrast, covers multi...

متن کامل

Characteristics of S-jump on Roughened Bed Stilling Basin

Formation of hydraulic jump is necessary for dissipation of the excess kinetic energy downstream of spillways and gates. Hydraulic jump stilling basins of sudden expansion are one of the energy dissipater structures which are used where the available tail water depth is less than the required sequent depths. In the present study to further reduce the hydraulic jump sequent depths ratio, a rough...

متن کامل

A Comparison of Parallel Sorting Algorithms on Different Architectures

In this paper, we present a comparative performance evaluation of three different parallel sorting algorithms: bitonic sort, sample sort, and parallel radix sort. In order to study the interaction between the algorithms and the architecture, we implemented all the algorithms on three different architectures: a MasPar MP1202, a mesh-connected computer with 2048 processing elements; an nCUBE 2, a...

متن کامل

A Non-dominated Sorting Ant Colony Optimization Algorithm Approach to the Bi-objective Multi-vehicle Allocation of Customers to Distribution Centers

Distribution centers (DCs) play important role in maintaining the uninterrupted flow of goods and materials between the manufacturers and their customers.This paper proposes a mathematical model as the bi-objective capacitated multi-vehicle allocation of customers to distribution centers. An evolutionary algorithm named non-dominated sorting ant colony optimization (NSACO) is used as the optimi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013